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(57) Abstract: Protein lattice (1) having a regular structure with a repeating unit repeating in three dimensions may have many 
uses, for example to support an array of macromolecular entities for x-ray crystallography- The repeating unit comprises protein 
protomers (2) which each comprise at least two monomers (5, 6) fused together. The monomers (5, 6) are each monomers of a 
respective oligomer assembly (3, 4) into which the monomers are assembled for assembly of the protomers into the lattice. The 
first oligomer assembly (3) has a set of rotational symmetry axes extending in three dimensions. In said protomers (2), further 
monomers (6) fused to said first monomers (5) are monomers of respective further oligomer assemblies (4) which have a rotational 
symmetry axis of the same order as a respective one of said set of rotational symmetry axes of said first oligomer assembly (3). Thus, 
the repeating unit includes protomers (2) with the first monomers (5) of the protomers (2) being assembled into said first oligomer 
assembly (3) and, in respect of respective ones of said set of rotational symmetry axes, with further monomers (6) of the protomers (2) 
fused to respective first monomers (3) being assembled into respective further oligomer assemblies (4). As a result of the symmetry 
of the oligomer assemblies (3, 4) said rotational symmetry axis of said respective further oligomer assemblies (4) is aligned with the 
respective rotational symmetry axis of said first oligomer assembly (3). Thus, an N-fold fusion between the oligomer assemblies (3, 
4) is produced and the rotational symmetry axes of the oligomer assemblies (3, 4) define the symmetry of the lattice. 
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PROTEIN LATTICE 

The present invention relates to protein lattices having a regular structure 
repeating in three dimensions. The protein lattices are based on symmetrical oligomer 
assemblies capable of self-assembly from the monomers of the oligomer assembly. Such 
5 protein lattices may have pores with dimensions of the order of nanometres to hundreds of 
nanometres. As such, the protein lattices are nanostructures which have many potential 
uses, for example as a matrix to support macromolecular entities for X-ray crystallography. 

WO-00/68248 discloses regular protein structures based on symmetrical 
oligomer assemblies capable of self-assembly. In particular, WO-00/68248 discloses 
1 0 structures formed from protein protomers (referred to as a "fusion protein" in WO- 
00/68248) comprising at least two monomers (referred to as "oligomerization domains" in 
WO-00/68248) which are each monomers of a respective symmetrical oligomer assembly. 
Self-assembly of the monomers into the oligomer assembly causes assembly of the regular 
structures themselves. Several different types of structures are disclosed, including 
1 5 discrete structures and structures extending in one, two and three dimensions. 

In WO-00/68248, the relative orientations of the monomers within the protomers 
are selected to provide the desired regular structure upon self-assembly. The monomers 
are fused together through a rigid linking group which is carefully selected to provide the 
requisite relative orientation of the monomers in the protomer. For example, in the 
20 laboratory production reported in WO-00/68248, the selection of the protomer was 

performed using a computer program to model monomers connected by a linking group in 
the form of a continuous, intervening alpha-helical segment over a range of incrementally 
increased lengths. Thus, the lattices suggested in WO-00/68248 having a regular structure 
repeating in three dimensions are formed from protomers comprising two monomers of 
25 respective dimeric or trimeric oligomer assemblies which are symmetrical about a single 
rotational axis. The relative orientation of the two monomers is selected to provide a 
specific angle of intersection between the rotational symmetry axis of the two oligomer 
assemblies. Thus, there is a single fusion between the two oligomer assemblies and the 
relative orientation of the oligomer assemblies is controlled by careful selection of the 
30 linking group providing the fusion. 

WO-00/68248 only reports laboratory production of protein structures of a 
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discrete cage and a filament extending in one dimension. It is expected that application of 
the teaching of WO-00/68248 to protein lattices repeating in three dimensions would 
encounter the following difficulties. Firstly, it is expected that there would be a difficulty 
in design arising from the requirement to select the relative orientation of the monomers 
within the protomer appropriate for constructing a lattice. This would probably reduce the 
numbers of types of oligomer assembly available to form a protein lattice, and hence make 
it difficult to identify suitable proteins. Secondly, it is expected that practical difficulties 
would be encountered during assembly. The structures disclosed in WO-00/68248 rely on 
the rigidity of the fusion between monomers in protomers which forms the single fusion 
between oligomer assemblies. WO-00/68248 teaches that the relative orientation of the 
monomers in the protomers controls the relative orientation of the oligomer assemblies in 
the resultant structure, so it is expected that flexing of the fusion away from the desired 
relative orientation would reduce the reliability of self-assembly. It is expected that such a 
problem would become more acute as the size of the repeating unit increases, thereby 
providing a practical restriction on the reliable production of lattices with a relatively large 
pore sizes. 

Accordingly, it would be desirable to provide protein lattices having a different 
type of structure in which these expected problems might be alleviated. 

According to a first aspect of a present invention, there is provided a protein 
lattice having a regular structure with a repeating unit repeating in three dimensions, the 
repeating unit comprising protein protomers which each comprise at least two monomers 
fused together, the monomers each being monomers of a respective oligomer assembly into 
which the monomers are assembled for assembly of the protomers into the lattice, wherein 
the repeating unit comprises protomers comprising at least a first monomer which is a 
monomer of a first oligomer assembly which has a quaternary structure which is 
symmetrical in three dimensions. 

As a result of using at least a first oligomer assembly which is symmetrical in 
three dimensions, the structure of the repeating unit and hence the protein lattice is derived 
from the symmetry of the oligomer assembly. In particular, it is not dependent on the 
relative orientation of the monomers within the protomer. Therefore, protein lattices in 
accordance with the present invention may be designed by selecting oligomers assemblies 
wherein at least the first oligomer assembly has an appropriate three dimensional symmetry 
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to build a lattice repeating in three dimensions. Protomers are then produced comprising 
monomers of the selected oligomer assemblies fused together. Subsequently, the 
protomers are allowed to self-assemble under suitable conditions. As described in more 
detail below, the chosen symmetries of the oligomer assemblies cause the protomers to 
5 self-assemble into the protein lattice. 

To assist in understanding, reference is made to Fig. 1 which illustrates a 
particular example of a protein lattice 1 in accordance with the present invention, as 
described in more detail below. In particular, the protein lattice 1 has a regular structure 
with a repeating unit comprising a first oligomer assembly 3 which is symmetrical in three 
10 dimensions, which in this example is human heavy chain ferritin which has octahedral 

symmetry. Each of the monomers 5 of the first oligomer assembly 3 is fused to a further 
monomer 6 of a further oligomer assembly 4 which in this example is E. Coli PurE has 
symmetry belonging to the dihedral D 4 point group 4. The further monomers 6 are 
assembled into the further oligomer assemblies 4 arranged with their rotational symmetry 
1 5 axes of order 4 aligned along the rotational symmetry axes of order 4 of the first oligomer 
assembly 3. Thus, the symmetry of the repeating unit, and hence the symmetry of the 
protein lattice 1, is the same as the symmetry of the set of rotational symmetry axes of 
order 4, as will be described in more detail below. 

Accordingly, the present invention involves the use of a different class of 
20 oligomers assemblies from that used in WO-00/68248 and provides the benefit that one is 
not restricted by the selection of the relative orientation of the monomers within the 
protomer. Thus it is expected that the design of protein lattice will be assisted in that the 
relative orientation of the monomers withing the protomer is a less critical constraint. 
Similarly, it is expected that more reliable assembly of the protein lattices will be possible, 
25 as described in more detail below. 

According to other aspects of the present invention, there is provided an 
individual protomer or plural protomers capable of self-assembly to form such a protein 
lattice, as well as polynucleotides encoding such protomers, vectors and host cells capable 
of expressing such promoters and methods of making the protomers. 
30 The present invention will now be described in more detail by way of non- 

limitative example with reference to the accompanying drawings in which: 

Fig. 1 is a diagram schematically illustrating, for a first protein lattice, the design 
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of a homologous protomer based on two oligomer assemblies and production of the lattice 
itself; 

Fig. 2 is a diagram schematically illustrating, for a second protein lattice, the 
design of two heterologous protomers based on three oligomer assemblies and production 

5 of the lattice itself; and 

Fig. 3 is a picture of an experimentally produced protein lattice of the type 

illustrated in Fig. 1. 

Protein lattices in accordance with the present invention may be designed by 
selecting oligomer assemblies, at least a first of which is symmetrical in three dimension, 
10 which fused together produce a repeating unit which is capable of repeating in three 
dimensions. As the symmetry of the repeating unit, and hence the lattice as a whole, 
depends on the symmetry of the oligomer assemblies, this involves a selection of oligomer 
assemblies having a quaternary structure which provides appropriate symmetries. This is a 
straightforward task, because the symmetries of oligomer assemblies are generally 
1 5 available in the scientific literature on proteins, for example from The Protein Data Bank; 
H. M. Berman, J. Westbrook, Z. Feng, G. Gilliland, T. N. Bhat, H. Weissig, I. N. 
Shindyalov & P. E. Bourne; Nucleic Acids Research, 28 pp. 235-242 (2000) which is the 
single worldwide archive of structure data of biological macromolecules, also available 
through websites such as http://www.rcsb.org. 
20 In some lattices, the repeating unit repeats in the same orientation across the 

lattice. In other lattices two or more adjacent repeating units together form a unit cell 
which repeats in the same orientation across the lattice, but with the repeating units within 
a unit cell arranged in different orientations. 

Examples of oligomer assemblies which produce lattices with a repeating unit 
25 repeating in three dimension are given below. 

Advantageously, the first oligomer assembly has a quaternary structure with a set 
of rotational symmetry axes extending in three dimensions. As a result, said repeating unit 
includes protomers with the first monomers of the protomers being assembled into said 
first oligomer assembly and, in respect of respective ones of said set of rotational 
30 symmetry axes, with further monomers of the protomers fused to respective first 
monomers being arranged symmetrically around said respective one of said set of 
rotational symmetry axes. 
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The arrangement of the repeating unit, and hence the lattice as a whole, is 
therefore dependent on the symmetries of the first oligomer assembly. In particular, in the 
assembled first oligomer assembly, inevitably and by definition, there are groups of first 
monomers arranged symmetrically around each of the set of rotational symmetry axes of 
the first oligomer assembly. This is because the symmetry results from the identical 
monomers being so arranged around the rotational symmetry axes. 

Since the further monomers are each fused to a respective first monomer, it 
follows that groups of the further monomers are also arranged symmetrically around each 
of the set of rotational symmetry axes. The further monomers are held in this symmetrical 
arrangement by being attached to first monomers in the first oligomer assembly. These 
groups of symmetrically arranged further monomers fused to the first oligomer assembly 
self-assemble with other monomers (which may be corresponding further monomers of 
another repeating unit, or may be monomers in a different part of the same unit cell) to 
form further oligomer assemblies, which are also arranged symmetrically around the set of 
rotational symmetry axes of the first oligomer assembly. 

Thus, the arrangement of the repeating unit, and hence the lattice as a whole, is 
dependent on the symmetries of the first oligomer assembly, not on the relative orientation 
of the monomers within an individual protomer. In other words, the present invention 
provides the advantage that the three dimensional structure of the protein lattice may be 
based solely on the symmetries of the oligomer assemblies. This provides advantages in 
the design of the protein lattices. This is to say, the design of the repeating unit and hence 
the lattice as a whole may be based on the symmetries of the oligomer assemblies. This 
makes it easy to select appropriate oligomer assemblies for use in the protein lattice. 

Desirably, the first oligomer assembly has a quaternary structure with a set of 
rotational symmetry axes extending in three dimensions, and, in said protomers, further 
monomers fused to said first monomers are monomers of respective further oligomer 
assemblies which have a rotational symmetry axis of the same order as a respective one of 
said set of rotational symmetry axes of said first oligomer assembly. As a result, said 
repeating unit includes protomers with the first monomers of the protomers being 
assembled into said first oligomer assembly and, in respect of respective ones of said set of 
rotational symmetry axes, with further monomers of the protomers fused to respective first 
monomers being assembled into respective further oligomer assemblies with said rotational 
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symmetry axis of said respective further oligomer assemblies being aligned with the 
respective rotational symmetry axis of said first oligomer assembly. 

The arrangement of the repeating unit and hence the lattice as a whole are 
therefore dependent on the symmetries of the first and further oligomer assemblies. In 
particular, as described above, in the first oligomer assembly there are groups of first 
monomers arranged symmetrically around each of the set of rotational symmetries axes, 
which in turn result in groups of the further monomers fused to the first monomers also 
being arranged symmetrically around each of the set of rotational symmetry axes of the 
first oligomer assembly. These groups of symmetrically arranged further monomers fused 
to the first oligomer assembly self-assemble with other monomers to form the further 
oligomer assembly. The further monomers may be further monomers of another repeating 
unit or may be monomers in a different part of the same repeating unit. 

As a result of the further monomers fused to the first oligomer assembly being 
arranged symmetrically around a rotational symmetry axis of the first oligomer assembly, 
it follows that the further oligomer assembly is held with the group of fused further 
monomers also held symmetrically around that rotational symmetry axis of the first 
oligomer assembly. However, inevitably and by definition, the further monomers also 
assemble in the further oligomer assembly in a symmetrical arrangement around the 
rotational symmetry axis of the further oligomer assembly. Thus, the result of the further 
oligomer assembly having a rotational symmetry axis of the same order as one of the set of 
rotational symmetry axes of the first oligomer assembly is that the first and further 
oligomer assemblies assemble with their symmetry axes aligned with one another. It 
follows from the symmetry of both oligomer assemblies that this is the most stable 
arrangement. This results in an N-fold fusion between the first and further oligomer 
assemblies, where N is a plural number equal to the order of the respective rotational 
symmetry axis of the first oligomer assembly and the rotational symmetry axis of the 
further oligomer assembly. In each of the first and further oligomer assemblies, there are N 
monomers arranged around the rotational symmetry axis, each of the monomers being 
fused within a respective protomer to a monomer of the other oligomer assembly. 

Thus the set of rotational symmetry axes does not include all the rotational 
symmetry axes of the first oligomer assembly. Rather the set comprises the rotational 
symmetry axes of the first oligomer assembly which are of the same order as rotational 
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symmetry axes of the further oligomer assembly. For example in the example of Fig. 1, 
the set of rotational symmetry axes of the first oligomer assembly 3 are the rotational 
symmetry axes of order 4, rather than those of order 3 or 2, due to the further oligomer 
assembly 4 having rotational symmetry axes of order 4. Further examples are given below. 
5 The particular choice of symmetries of the first and further oligomer assemblies 

results, on assembly of the protomers into the lattice, in the oligomer assemblies being 
built up with their rotational symmetry axes aligned. This means that the arrangement of 
the repeating unit, and hence the lattice as a whole, is controlled by the symmetries of the 
first and further oligomer assemblies, not on the relative orientation of the monomers 
1 0 within an individual protomer. In other words, the present invention provides the 

advantage that the three dimensional structure of the protein lattice may be based solely on 
the symmetries of the oligomer assemblies. This is advantageous in the design of the 
protein lattice. By basing the three dimensional structure of the repeating unit and hence 
lattice as a whole, on the symmetries of the oligomer assemblies, it is easier to select 
1 5 appropriate oligomer assemblies to form a lattice. During design, the relative orientation 
of the monomers within an individual protomer in its unassembled form becomes a much 
lower constraint than is present in, for example, WO-00/68248. 

There are also expected to be advantages during self-assembly of the lattice. In 
particular, the formation of an N-fold fusion between two given oligomer assemblies 
20 results in the bond between the two oligomer assemblies being relatively rigid. This is 

expected to reduce relative motion of the oligomer assemblies during the assembly process. 
This is expected to assist in reliable formation of the lattice with the oligomer assemblies in 
the correct relative positions. 

Although there are particular advantages in the use of a further oligomer 
25 assembly which has a rotational symmetry axis of the same order as the rotational 

symmetry axes of the first oligomer assembly, this is not essential. Alternatively, it would 
be possible for the further monomers arranged symmetrically around the rotational 
.symmetry axes of the first oligomer assembly to be monomers of separate oligomer 
assemblies, for example of dimeric oligomer assemblies (being heterologous or 
30 homologous). In that case, the further oligomer assembly would effectively be replaced by 
a group of separate dimeric oligomer assemblies, equal in number to the order of the 
rotational symmetry axis of the first oligomer assembly, with the separate dimeric oligomer 
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assemblies held around the rotational symmetry axis of the first oligomer assembly in an 
arrangement which might or might not have the N-fold symmetry of the rotational 
symmetry axis of the first oligomer assembly. 

The form and production of the protomers will now be described. Except that 

5 the present invention involves protomers in which a different choice of monomers from 

WO-00/68248 are fused together, the form and production of the protomers per se, as well 
as the polynucleotide encoding the protomers, may be as the same as disclosed in WO- 
00/68248 which is therefore incorporated herein by the reference. 

The nature of the monomers themselves will now be described. 

10 The monomers are monomers of oligomer assemblies .which are capable of self- 

assembly under suitable conditions to produce a protein lattice. The secondary and tertiary 
structure of the monomers is unimportant in itself providing they assemble into a 
quaternary structure with the required symmetry. However, it is advantageous if the 
protein is easily expressed and folded in an heterologous expression system (for example 

1 5 using plasmid expression vector in E. Coli). 

The monomers may be naturally occurring proteins, or may be modified by 
peptide elements being absent from, substituted in, or added to a naturally occurring 
protein provided that the modifications do not substantially affect the assembly of the 
monomers into their respective oligomer assembly. Such modifications are in themselves 

20 known for a number of different purposes which may be applied to monomers of the 
present invention. In other words, the monomer may be a homologue and/or fragment 
and/or fusion protein of a naturally occurring protein. 

The monomer may be chemically modified, e.g. post-translationally modified. 
For example, it may be glycosylated or comprise modified amino acid residues. 

25 The monomers are preferably fused genetically, although in principle other 

fusions are possible such as chemical fusions. 

Although the monomers may be fused directly together, preferably the 
monomers are fused by a linking group of peptide or non-peptide elements. In general, 
linking two proteins by a linking group is known for other purposes and such linking 

30 groups may be applied to the present invention. 

Another factor in the selection of appropriate oligomer assemblies is the location 
and orientation of (a) the termini of the first monomers when arranged in the first oligomer 
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assembly in its natural form (i.e. not fused to a further oligomer assembly) and (b) the 
termini of the further monomers when arranged in the further oligomer assembly in its 
natural form (i.e. not fused to the first oligomer assembly). Such information on the 
arrangement of the termini in the oligomer assembly in its natural form is generally 
available for oligomer assemblies, for example from The Protein Data Bank referred to 
above. Ideally, these termini should have the same separation and orientation, because 
they will be fused together in the assembled protein lattice to constitute the N-fold fusion 
arranged symmetrically around a rotational symmetry axis. That being said, it is not 
essential for the separation and orientation to be the same, because any difference may be 
accommodated by deformation of the monomers near the N-fold fusion and/or by use of a 
linking group. Therefore, as a general point, oligomer assemblies should be chosen in 
which the termini of both oligomer assemblies which are to be fused together in an N-fold 
fusion allows formation of the fusion without preventing assembly of the oligomer 
assemblies and hence the protein lattice. 

Considering the deformation of the monomers near the N-fold fusion mentioned 
above, it is desirable to minimise such deformation which will tend to reduce the reliability 
of the assembly process. However, if a linking group is fused between the monomers, such 
deformation may be taken up, at least partially, by the Unking group itself. This reduces 
the deformation of the monomers, thereby increasing the reliability of self-assembly 
because the linking group does not take part in the assembly process as regards to not being 
part of the naturally occurring protein. There is a particular advantage of the use of a 
linking group. 

Furthermore, the linking group may be specifically designed to be oriented 
relative to the first and further monomers in the protomer in its normal form, prior to 
assembly, to reduce such differences in the position and/or orientation of the termini of the 
first and further monomers. Using position and orientation of the termini of the first and 
further monomers in the first and further oligomer assemblies in their natural form which is 
generally available for oligomer assemblies, as discussed above, it is possible to design an 
appropriate linking group using conventional modelling techniques. 

Typically, the monomers are fused at their end termini. Alternatively, the 
monomers may be fused at an alternative location in the polypeptide chain so long as the 
native fold and symmetry of the naturally occurring oligomer assembly remains the same. 
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For example, one of the monomers may be inserted into a structurally tolerant portion of 
the other monomer, for example in a loop extending out of the oligomer assembly. Also, 
truncation of a monomer is feasible and may be estimated by structural examination. 

Some examples of symmetries for the oligomer assemblies to produce a protein 
lattice are as follows. 

In the examples, the first oligomer assembly which is symmetrical in three 
dimensions belongs to one of a tetrahedral point group, an octahedral point group or a 
dihedral point group. 

In some classes of protein lattice, the protomers are homologous with respect to 
the monomers, ie there is a single type of protomer within the protein lattice. For example, 
Table 1 represents some simple homologous protomers capable of forming a protein 
lattice. 



Protomer 


Class Name 


M 


N 


P3P3 


Platonic 


12 


3 


P4P4 


Platonic 


24 


4 


P4P3 


Platonic 


24 (or 12) 


3 


p 3 d 3 


Mixed 


12 


3 


p 3 d 2 


Mixed 


12 


2 


P4<*4 


Mixed 


24 


4 


P 4 d 3 


Mixed 


24 


3 


P 4 d 2 


Mixed 


24 


2 


d 3 d 3 d 2 


Dihedral 


6 


3,2 


d 4 d 4 d 2 


Dihedral 


8 


4,2 


ded^ 


Dihedral 


12 


6,2 



Table 1 - Homologous Protomers 



In Table 1, each protomer is identified by letters which represent the respective 
monomers of the protomer. In particular the letters identify the point group to which the 
oligomer assembly of that monomer belongs. For each letter, the subscript number 
represents the order of the point group. The letter p represents a platonic point group, so p 3 
represents a tetrahedral point group, and p 4 represents an octahedral point group. The letter 
d represents a dihedral point group. 

In the final two columns of the table, there is given the number M of first 
monomers in the first oligomer assembly and the order(s) N of the set of rotational 
symmetry axes of the first oligomer assembly. N is also the order of the rotational 
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symmetry axis of the further oligomer assembly aligned with a respective rotational 
symmetry axis of the first oligomer assembly, and around which there is formed an N-fold 
fusion between the first and further oligomer assemblies. 

The protomers have been divided into classes which have been named according 
5 to the nature of the monomers of the proteins for ease of reference. 

In both the platonic and mixed classes, the first oligomer assembly belongs to a 
platonic point group, which is either a tetrahedral point group or an octahedral point group. 

In the mixed class, the further monomer is a monomer of an oligomer assembly 
belonging to a dihedral point group. In each case, the order N of the dihedral point group, 
1 0 which is the order of the principal rotational symmetry axis of the dihedral point group, is 
equal to the order of one of the rotational symmetry axes of the first oligomer assembly. 
This may either be the principal rotational symmetry axis of the first oligomer assembly or 
one of the rotational symmetry axes of the first oligomer assembly of lower order. The 
rotational symmetry axes of the first oligomer assembly of order N therefore constitute the 
15 set of rotational symmetry axes of the first oligomer assembly. The symmetries of the first 
and further oligomer assemblies results in the formation of a unit cell in which the 
principal rotational symmetry axis of each further oligomer assembly belonging to a 
dihedral point group is aligned with one of set of rotational symmetry axes of order N of 
the platonic point group, with an N-fold fusion therebetween, in the manner described 
20 above. 

The protein lattices of the mixed class are the easiest to visualise. In particular, 
the first oligomer assembly belonging to a platonic point group may be visualised as a node 
from which the set of rotational symmetry axes of order N extend outwardly. The dihedral 
point groups may be visualised as linear links with the principal rotational symmetry axis 
25 of the dihedral point group aligned with one of the set of rotational symmetry axes of order 
N of the first oligomer assembly. In this way, it is easy to visualise the formation of the 
lattice with pores in the spaces between the oligomer assemblies. 

Fig. 1 illustrates a particular example of a protein lattice 1 belonging to the 
mixed class, in particular having a protomer 2 represented by p 4 d 4 . The first oligomer 
30 assembly 3 is human ferritin heavy chain (HFH) which belongs to an octahedral point 

group. The further oligomer assembly is E. Coli PurE which belongs to a dihedral D 4 point 
group of order 4. The protomer comprises a first monomer 5 of the first oligomer 
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assembly 3 and a further monomer 6 of the further oligomer assembly 4 fused together. 
On assembly, the protomers 2 form a lattice 1 in which the repeating unit (which is also a 
unit cell) may be taken as, for example, one of the first oligomer assemblies 3, together 
with and half of each of the adjacent second oligomer assemblies 4 formed by the further 
monomers 6 fused to the first monomers 5 of that first oligomer assembly 1. Clearly 
visible from Fig. 1 is the symmetry of the protein lattice 1 based on the symmetries of the 
first oligomer assembly 3 and the further oligomer assembly 4. In particular as the 
rotational symmetry axes of order 4 of the further oligomer assembly 4 are aligned with the 
set of rotational symmetry axes of order 4 of the first oligomer assembly 3 the symmetry of 
the lattice is the same as the symmetry of the set of rotational symmetry axes. 

In the platonic class, the further oligomer assembly belongs to a platonic point 
group as well as the first oligomer assembly. 

In the first two protein lattices where the protomers belong to platonic point 
groups of the same order, the first and further oligomer assemblies may be identical, in 
which case the first and further monomers are also identical, or may be different oligomer 
assemblies belonging to an identical point group. The set of rotational symmetry axes of 
order N around which is formed an N-fold fusion are the principal rotational symmetry 
axes of the two oligomer assemblies. 

In the third protein lattice in the platonic class where the first and further 
oligomer assemblies belong respectively to tetrahedral and octahedral point groups (or vice 
versa), the rotational symmetry axes of order N around which the N-fold fusion occurs are 
the rotational symmetry axes of order 3 of the two oligomer assemblies. In this case, either 
one of the oligomer assemblies may be considered as the first oligomer assembly. If the 
oligomer assembly belonging to a tetrahedral point group is considered as the first 
oligomer assembly, then the set of rotational symmetry axes are the principal rotational 
symmetry axes. If the oligomer assembly belonging to an octahedral point group is 
considered as the first oligomer assembly, then the set of rotational symmetry axes are the 
set of rotational symmetry axes of order 3, because this is the order of the rotational 
symmetry axes of the further oligomer assembly belonging to the tetrahedral point group. 

The platonic class may be visualised by considering each oligomer assembly as a 
node from which the set of rotational symmetry axes of order N extend outwardly and 
joined to the rotational symmetry axes of an oligomer assembly of the opposite type. 
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Lastly, in the dihedral class, the protomers comprise three monomers all 
belonging to a dihedral point group. The central monomer may be considered as the first 
monomer of a first oligomer assembly belonging to a dihedral point group of order 3, 4 or 
6. The monomers fused to each terminus of the first oligomer assembly may each be 

5 considered as the further monomers. One of the further monomers is a monomer of a 
fiirther oligomer assembly belonging to a dihedral point group of the same order as the 
dihedral point group of the first oligomer assembly. Thus, as a result of the symmetries of 
the first oligomer assembly and this one of the further oligomer assembly, this results of 
the formation of a repeating unit in which the principal rotational symmetry axes of both 

10 oligomer assemblies (i.e. the rotational symmetry axis of the same order as the dihedral 
point group) are aligned. Therefore, in the protein lattice, these oligomer assemblies are 
arranged in columns along which the first and further oligomer assemblies are alternately 
arranged. 

The other of the further monomers is a monomer of an oligomer assembly 
15 belonging to a dihedral point group of order 2 and so have a rotational symmetry axis of 
order 2 which is equal to the rotational symmetry axis of order 2 of the first oligomer 
assembly. Such rotational symmetry axes of the first oligomer assembly are equal in 
number to the order of the dihedral point group to which the first oligomer assembly 
belongs, and extend perpendicular to the principal rotational symmetry axis of the dihedral 
20 point group, being arranged symmetrically around that principal rotational symmetry axis. 
Therefore, the further oligomer assemblies belonging to a dihedral point group of order 2 
are arranged in the assembled protein lattice with their principal rotational symmetry axes 
aligned to the just described rotational symmetry axes of order 2 of the first oligomer 
assembly. As these extend perpendicular to the principal rotational symmetry axes of the 
25 first oligomer assembly, the further oligomer assemblies belonging to a dihedral point 

group of order 2 may be considered as links between the columns of oligomer assemblies 
described above. 

In other words, the set of rotational symmetry axes of the first oligomer assembly 
includes the principal rotational symmetry axis of order 3, 4 or 6, together with the 
30 rotational symmetry axes of order 2 perpendicular to the principal rotational .symmetry 
axis. 

In other classes of protein lattice, the protomers are heterologous with respect to 
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the monomers i.e. there are two or more types of protomer in the protein lattice. To 
achieve assembly of any two types of protomer, the two types of protomer include different 
monomers of the same heterologous oligomer assembly. Thus when the protomers of the 
different types are allowed to assemble, the heterologous oligomer assemblies assemble, 
thereby linking the protomers of the two types. However, in contrast to homologous 
protomers, a single type of protomer cannot by itself assemble into the entire protein 
lattice. The individual monomers of the heterologous oligomer assembly cannot self- 
assemble into the entire heterologous oligomer assembly in the absence of the other, 
different monomers of that heterologous assembly. This provides advantages during 
manufacture of the protein lattices, because each type of protomer may be separately 
produced without assembly of an entire protein lattice which might otherwise disrupt the 
production of the protomer. This allows production in a two-stage process, which will be 

described in more detail below. 

Preferably, the heterologous oligomer assembly belongs to a cyclic point group. 

In this case, the heterologous oligomer assembly may constitute a further oligomer 

assembly which is fused in the assembled lattice by an N-fold fusion to the first oligomer 

assembly. 

In the simplest types of protein lattice, the heterologous protomers each further 
comprise a monomer of a homologous oligomer assembly, which may be the first oligomer 
assembly. The individual types of protomer may assemble into a respective, discrete 
component of the unit cell, as a result of the monomers of the homologous oligomer 
assembly self-assembling. This is an advantage of the heterologous protomers, because 
assembly of the lattice may be avoided until the components are brought together. 
Otherwise assembly of the lattice might hinder the production of the protomers themselves. 

For example, Table 2 represents some simple heterologous protomers capable of 
forming a protein lattice. 



Protomer 


Components 


Name 


1st Protomer 


2nd Protomer 








M 


N 


M 


N 


P3C3A + P3 C 3A» 


P3/P3 


Platonic 


12 


3 


12 


3 


P4<bA + P3 C 3A« 


P4/P4 


Platonic 


24 


3 


12 


3 


P4C3A + P&A* 


P4/P3 


Platonic 


24 


3 


12 


3 



30 
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P3C3A + d jC3A« 


P3/D3 


Mixed 


12 


3 






P3°2A + dzCjA. 


r 3 /JJ 2 


IVliAwU 


12 


2 






p4 C 4A + ^4C4A' 




A. t^J. 


24 


4 






p4^3A + d 3°3A* 






24 


3 






P4C2A + d 2<^A* 


p /n 


"Mixed 


24 


2 






c 3A d 3 d 2 + c 3A .d 3 d 2 


D 3 /D 3 


Dihedral 


6 


3,2 


6 


3,2 


C4Ad 4 d 2 + c 4A *d 4 d 2 


D^ 3 


Dihedral 


8 


4,2 


8 


4,2 


CeA^i* c 6A*d*d 2 


D</D 6 


Dihedral 


12 


6,2 


12 


6,2 


d 3 d 3 C2 A +d 3 d 3 C2 A . 


D 3 /D 3 


Dihedral 


6 


3,2 


6 


3,2 


d 4 d 4 c 2A +d 4 d 4 c 2A , 


D 4 /D 4 


Dineoral 


Q 
O 


4, 1 


O 
O 




d 6 d 6 c 2A + d 6 d 6 c 2A * 


De/D 6 


Dihedral 


12 


6,2 


12 


6,2 


c 3 Ad 3 c 2B +C3A* d 3 C2 B , 


D3/D3 


Dihedral 


6 


3,2 


6 


3,2 


C 4 Ad4C2B + C 4A* d 4 C2 B « 


D 4 /D 4 


Dihedral 


8 


4,2 


8 


4,2 


C 6Ad6 C 2B + C 6A* 


D^ 6 


Dihedral 


12 


6,2 


12 


6,2 



1 5 Table 2 - Heterologous Protomers 

In Table 2, monomers of a single heterologous oligomer assembly belonging to a 
cyclic point group are used so that the protein lattice is formed from two types of protomer 
identified in the first column. Each of the protomers includes one of the monomers of the 
heterologous oligomer assembly. 

20 In Table 2, the monomers of each protomer are identified by lower case letters in 

similar manner as in Table 1 . The lower case letters p and d have the same meaning as in 
Table 1 . In addition, lower case c represents a monomer of a heterologous oligomer 
assembly belonging to a cyclic point group. The subscript number again represents the 
order of the point group. The subscript capital letters A and A* are used to identify the two 

25 different monomers of the same heterologous assembly. 

In Table 2, the second column identifies the point groups to which the 
components resulting from the assembly of each type of protomer belongs. A similar 
notation is used as for the monomers of the protomer, except that capital letters are used to 
indicate that the point group of the component is being referred to. Thus capital letter P 

30 indicates that the component belongs to a platonic point group, so P 3 represents a 

tetrahedral point group and P 4 represents an octahedral point group. Capital letter D 
indicates that the component belongs to a dihedral point group. In a similar manner to 
Table 1, the final columns give, in respect of each protomer where appropriate, the number 
M of monomers in the first oligomer assembly and the order(s) N of the set of rotational 
35 symmetry axes of the first oligomer assembly which are aligned with the rotational 
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symmetry axis of a further oligomer assembly. 

For ease of reference, the protein lattices are divided into classes on the basis of 
the symmetry of their components, in a similar manner to the division of the protein 
lattices formed from homologous protomers. In each case, the heterologous protomers 
may be derived from the protomers of the corresponding class of homologous protomer in 
Table 1. 

For the mixed class and the platonic class, the two types of protomers both 

comprise: 

(a) a monomer of a homologous oligomer assembly which belongs to the same point 
j q group as a respective one of the monomers of the corresponding homologous 

protomer; and 

(b) a monomer which is a respective one of the two different monomers of the 
heterologous oligomer assembly which belongs to a cyclic point group. 

The order of the cyclic point group to which the heterologous oligomer assembly 
1 5 belongs is the same as the order N of the N-fold fusion between the oligomer assemblies of 
the protein lattice formed from the corresponding homologous protomer, that is the order 
of the respective rotational symmetry axis of the first oligomer assembly. 

Thus, in the assembled protein lattice, the repeating unit has fundamentally the 
same arrangement as the repeating unit of the corresponding homologous protomer, except 
20 as follows. Instead of the N-fold fusion between the two homologous oligomer assemblies 
of the homologous protomer, the link between the homologous oligomer assemblies is 
extended by the insertion of the heterologous oligomer assembly. Therefore, it will be seen 
that the repeating unit of the heterologous oligomer assembly effectively extends the length 
of the links of the repeating unit between the first oligomer assemblies which may be 
25 considered as notes in the protein lattice. Thus, the size of the pores within the protein 
lattice is also increased relative to the use of the corresponding homologous protomers. 
Increasing the size of the pores in this manner represents a significant advantage of the use 
of heterologous protomers. 

Fig. 2 illustrates a particular example of a protein lattice 7 belonging to the 
30 mixed class, in particular having respective protomers 8 and 9 represented by p 3 c 3A and 
d 3 c 3A ., respectively. The first protomer 8 comprises a first monomer 10 of a first 
homologous oligomer assembly 11, namely is E.Coli dps which belongs to a tetrahedral 
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point group. Fused to the first monomer 10 in the first protomer 8 is a further monomer 12 
of a further heterologous oligomer assembly 13, namely bacteriophage T4 gp5 and gp27 
which belongs to a cyclic point group of order 3. On assembly, the first protomer 8 forms 
a first component 14 by the first monomers 10 assembling together. The first component 
5 14 has the same symmetry as the first oligomer assembly 1 1 of the first protomer 8. 

The second protomer 9 comprises a monomer 15 which is the other monomer of 
the further oligomer 13 of the first protomer 8 which is heterologous to the further 
monomer 12 of the first protomer 8. The second protomer 9 also comprises a monomer 16 
which is a monomer of a homologous oligomer assembly 17, namely human PTPS which 
1 0 belongs to a dihedral D 3 point group of order 3 . On assembly, the second protomer 9 forms 
a second component 18 by the homologous monomers 16 assembling together. 

When the first and second components 14 and 18 are brought together, they 
assemble to form the protein lattice 7 by assembly of the heterologous oligomer assembly 
13. It is clearly visible from Fig. 2 how the symmetry of the protein lattice 7 is based on 
15 the symmetries of the homologous oligomer assemblies 1 1 and 1 7. In particular , the 

rotational symmetry axes of order 3 of both the heterologous oligomer assembly 13 and the 
homologous oligomer assembly 17 of the second protomer 9 are aligned with the set of 
rotational symmetry axes of order 3 of the first oligomer assembly 1 1 of the first protomer 
8. It is further clear from Fig. 2 how the heterologous oligomer assemblies 13 effectively 
20 extend the length of the links between the first oligomer assemblies 11. In the lattice 7, the 
repeating unit may be taken, for example, as one of the first components 14 and half of 
each of the adjacent second components 18. In this case, the unit cell is formed by a 
number of such repeating units combined together. 

The protomers of the dihedral class of the heterologous comprise protomers 
25 comprising three monomers which may be derived from a corresponding one of the 

dihedral class of homologous protomers. In particular, the two types of protomer comprise 
the corresponding homologous protomer with either one (or both) of the further monomers 
of the corresponding homologous protomers replaced by respective monomers of a 
heterologous oligomer assembly belonging to a cyclic point group of the same order as the 
30 dihedral point group to which the oligomer assembly of the replaced monomer belongs. 

The above examples of protein lattices are believed to represent the simplest 
form of protomers capable of forming a protein lattice and are preferred for that reason. 
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However, it will be appreciated that other protomers formed from monomers of oligomer 
assemblies having suitable symmetries will be capable of forming a protein lattice. For 
example, other homologous protomers having larger numbers of monomers than listed in 
Table 1 will be capable of forming a protein lattice. Similarly, other heterologous 
protomers will be capable of forming a protein lattice. These may include two types of 
protomer having larger numbers of monomers than in the examples of Table 2, or may 
include more than two types of protomer. 

For each of the monomers, there is a large choice of oligomer assemblies having 
the required symmetry. The present invention is not limited to particular oligomer 
assemblies, because in principle any oligomer assembly having a quaternary structure with 
the requisite symmetry may be used. However, as examples Table 3 lists some possible 
choices of oligomer assembly for each of the point groups of Tables 1 and 2. 



Point 


Source 


Name of Oligomer Assembly 


PDB 


|r1*Alin 

OTOlip 






Code 


P 3 (I, 32) 


Jit. COU 


ops 






S enidef'niis 


EpiD 


1G63 


P 4 (0,432) 


Human 


heavy chain ferritin 


2FHA 




E.coli 


Dihydrolipoamide succinyltransferase 


1E20 




AMnelandii 


Dihydrolipoamide acetyltransferase 


1EAB 


D 2 


Human 


Mn superoxide dismutase 


1AP5 




PJalciparum 


lactate dehydrogenase 


1CEQ 


D 3 


Rat 


6-pyruvoyl tetrahydropterin synthase 


1B66 




E.coli 


Amino acid aminotransferase 


1I1L 


D 4 


E.coli 


PurE 


1QCZ 




Sipunculid worm 


Hemerythrin 


2HMQ 


D 6 


S.typhimurium 


Glutamine Synthetase 


1F1H 




Human 


Casein kinase alpha and beta chains 


1JWH 




Coliphate T4 


gp5 + gp27 


1K28 




HIV 


N36 + C34 


1AIK 




Pseudomonas putida 


Napthalene 1 ,2-Dioxygenase 


1NDO 


Qa + C 4A * 


Erachiopod 


Hemerythrin 


N/A 
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Table 3 - Example oligomer assemblies 

Thus the present invention provides a protein protomer or plural protein 
protomers capable of assembly into a protein lattice. The monomers of the protomer may 
be of any length but typically have a length of 5 to 1000 amino acids, preferably at least 20 
5 amino acids and/or preferably at most 500 amino acids. 

The invention also provides polynucleotides which encode the protein protomers 
of the invention. The polynucleotide will typically also comprise an additional sequence 
beyond the 5 and/or 3 ends of the coding sequence. The polynucleotide typically has a 
length of at least three times the length of the encoded protomer. The polynucleotide may 
10 be RNA or DNA, including genomic DNA, synthetic DNA or cDNA. The polynucleotide 
may be single or double stranded. 

The polynucleotides may comprise synthetic or modified nucleotides, such as 
methylphosphonate and phosphorothioate backbones or the addition of acridine or 
polylysine chains at the 3' and/or 5' ends of the molecule. 
1 5 Such polynucleotides may be produced and used using standard techniques. For 

example, the comments made in WO00/68248 about nucleic acids and their uses apply 
equally to the polynucleotides of the present invention. 

The monomers are typically combined to form protomers by fusion of the 
respective genes at the genetic level (e.g. by removing the stop codon of the 5' gene and 
20 allowing an in-frame read through to the 3' gene). In this case the recombinant gene is 
expressed as a single polypeptide. The genes may, alternatively, be fused at a position 
other than the end terminus so long as the quaternary structure of the oligomer assembly 
properties remains substantially unaffected. In particular, one gene may be inserted within 
a structurally tolerant region of a second gene to produce an in-frame fusion. 
25 Chemical fusion of the polypeptide chains may be used as an alternative to 

fusion at the genetic level. In this instance the polypeptides are fused post-translationally 
by means of the covalent linkage, but in particular through the use of intein chemistry. 

The invention also provides expression vectors which comprise polynucleotides 
of the invention and which are capable of expressing a protein protomer of the invention. 
30 Such vectors may also comprise appropriate initiators, promoters, enhancers and other 

elements, such as for example polyadenylation signals which may be necessary, and which 
are positioned in the correct orientation, in order to allow for protein expression. 
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Thus the coding sequence in the vector is operably linked to such elements so 
that they provide for expression of the coding sequence (typically in a cell). The term 
"operably linked" refers to a juxtaposition wherein the components described are in a 
relationship permitting them to function in their intended manner. 

The vector may be for example, plasmid, virus or phage vector. Typically the 
vector has an origin of replication. The vector may comprise one or more selectable 
marker genes, for example an ampicillin resistance gene in the case of a bacterial plasmid 
or a resistance gene for a fungal vector. 

Promoters and other expression regulation signals may be selected to be 
compatible with the host cell for which expression is designed. For example, yeast 
promoters include S. cerevisiae GAL4 and ADH promoters, S. pombe nmt\ and adh 
promoter. Mammalian promoters include the metallothionein promoter which can be 
induced in response to heavy metals such as cadmium. Viral promoters such as the S V40 
large T antigen promoter or adenovirus promoters may also be used. 

Mammalian promoters, such as p-actin promoters, may be used. Tissue-specific 
promoters are especially preferred. Viral promoters may also be used, for example the 
Moloney murine leukaemia virus long terminal repeat (MMLV LTR), the rous sarcoma 
virus (RS V) LTR promoter, the S V40 promoter, the human cytomegalovirus (CMV) IE 
promoter, adenovirus, HSV promoters (such as the HSV IE promoters), or HPV promoters, 
particularly the HPV upstream regulatory region (URR). 

Another method that can be used for the expression of the protein protomers is 
cell-free expression, for example bacterial, yeast or mammalian. 

The invention also includes cells that have been modified to express the 
protomers of the invention. Such cells include transient, or preferably stable higher 
eukaryotic cell lines, such as mammalian cells or insect cells, using for example a 
baculovirus expression system, lower eukaryotic cells, such as yeast or prokaryotic cells 
such as bacterial cells. Particular examples of cells which may be modified by insertion of 
vectors encoding for a polypeptide according to the invention include mammalian 
HEK293T, CHO, HeLa and COS cells. Preferably the cell line selected will be one which 
is not only stable, but also allows for mature glycosylation of a polypeptide. Expression 
may be achieved in transformed oocytes. 
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The protein protomers, polynucleotides, vectors or cells of the invention may be 
present in a substantially isolated form. They may also be in a substantially purified form, 
in which case they will generally comprise at least 90%, e.g. at least 95%, 98% or 99%, of 
the proteins, polynucleotides, cells or dry mass of the preparation. 
5 The protomers may be prepared using the vectors and host cells using standard 

techniques. For example, the comments made in WO-00/68248 regarding methods of 
preparing protomers (referred to as "fusion proteins" in WO-00/68248) apply equally to 
preparation of protomers according to the present invention. 

Assembly of the protein lattice from the protomers may be performed simply by 

10 placing the protomers under suitable conditions for self-assembly of the monomers of the 
oligomer assemblies. Typically, this will be performed by placing the protomers in 
solution, preferably an aqueous solution. Typically, the suitable conditions will correspond 
to those in which the naturally occurring protein self-assembles in nature. Suitable 
conditions may be those specifically disclosed in WO-00/68248. 

15 In the case of homologous protomers this results in direct assembly of the protein- 

lattice. 

In the case of heterologous protomers, assembly is preferably performed in plural 
stages. In a first stage, each type of protomer is separately assembled into a respective 
discrete component. In a second stage, the discrete components are brought together and 
20 assembled into the protein lattice. Where plural heterologous protomers are used, there 
may be further stages intermediate the first and second stage in which the respective 
discrete components are brought together and assembled into larger, intermediate 
components. 

A specific protein lattice of the type illustrated in Figure 1 has been prepared using 
25 the following method. 

Human ferritin heavy chain (HFH) and the E.coli PurE gene were amplified by 
PCR from human cDNA and E.coli gDNA respectively. Primers for amplification of the 
ferritin gene were: 5*-CCT TAG TCG AAT TCA TGA CGA CCG CGT CCA CC-3* and 
5'-GGG AAA TTA GCC CTC GAG TTA GCT TTC ATT ATC-3\ Primers for 
30 amplification of the PurE gene were: 5'-GTT TTA AGA CCC ATG GCT TCC CGC AAT 
AAT CCG-3' and 5*-CGC AAA CCT GGA TCC TGC CGC ACC TCG CGG-3'. The 
PurE gene was cloned into the pET-28b vector (Novagen) between the Ncol and BamHI 
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sites. The HFH gene was cloned into the resulting vector between the EcoRI and Xhol sites 
to create an in-frame fusion of the two genes under control of the T71ac promoter. 

This vector was transformed into E.Coli strain B834(pLysS) for expression. 
Induction of expression was as follows: a 10ml overnight culture of the expression strain 
5 (in LB broth containing 30|ig/ml Kanamycin) was diluted 1 : 100 into fresh LB broth 
containing 30jig/ml Kanamycin, Cells were grown with shaking at 37°C to a density 
corresponding to an OD^ of 0.6 and were then induced to express the target protein by the 
addition of IPTG to a final concentration of ImM. The culture was maintained at 37°C with 
shaking for a further 3 hours before the cells were harvested by centrifugation (5000g, 

1 0 1 Omin, 4°C). The cell pellet was resuspended in 20ml of buffer A (300mM NaCl, ImM 
EDTA, 50mM HEPES, pH7.5). Cells were lysed by sonication and the insoluble fraction 
harvested by centrifugation (25,000g, 30 min, 4°C). This fraction was dissolved in 8M urea 
and centrifuged (25,000g, 30 min, 4°C) to remove insoluble particles. The urea solubilised 
material was concentrated to 16mg/ml and passed through a 0.22 \im filter. A drop of this 

15 material (1 \xl) was then directly injected into a larger drop (5^1) of buffer A. Protein lattice 
particles were observed within one hour. Fig. 3 is a picture of one of the protein lattice 
particles having a diameter of approximately 0.6mm. The elemental composition of the 
protein lattice has been confined using jiPIXE techniques. 

Protein lattices in accordance with the present invention have numerous different 

20 uses. In general, such uses will take advantage of the regular repeating structure and the 

pores within the lattice. Lattices in accordance with the present invention may be designed 
to have pores with dimensions expected to be of the order of nanometres to hundreds of 
nanometres. Lattices may be designed with an appropriate pore size for a desired use. 

The highly defined, unusually sized and finely controlled pore sizes of the protein 

25 lattices together with the stability of their lattice structures make them ideal for 

applications requiring microporous materials with pore sizes in the range just mentioned. 
As one example, the lattices are expected to be useful as a filter element or molecular sieve 
for filtration or separation processes. In this use, the pore sizes achievable and the ability 
to design a pore's size would be particularly advantageous. 

30 In another class of use, macromolecular entities would be attached to the protein 

lattice. Such attachment may be done using conventional techniques. The macromolecular 
entities may be any entities of an appropriate size, for example proteins, polynucleotides or 
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non-biological entities. As such, the protein lattices are expected to be useful as biological 
matrices for carrying macromolecular entities, for example for use in drug delivery, or for 
crystallizing macromolecular entities. 

Attachment of the macromolecular entities to the protein lattice may be performed 
5 by "tagging" either or both of the protein protomers or the macromolecular entities of 
interest. In this context, tagging is the covalent addition to either or both of the protein 
protomers or the target macromolecular entities, of a structure known as a tag which forms 
strong interactions with a target structure. The target structure may be a further tag 
attached to the other of the protein protomer or target macromolecular entity, or may be a 
1 0 part of the protein protomer or target macromolecular entity. In the case of the protein 
protomer, or a macromolecular entities which is a protein, this may be achieved by the 
expression of a genetically modified version of the protein to carry an additional sequence 
of peptide elements which constitute the tag, for example at one of its termini, or in a loop 
region. Alternative methods of adding a tag include covalent modification of a protein 
1 5 after it has been expressed, through techniques such as intein technology. 

Thus to attach the macromolecular entity to the protein lattice, the protein 
protomers may include, at a predetermined position in the protomers, an affinity tag 
attached to the macromolecular entity of interest. 

Alternatively, the macromolecular entity of interest may have at a predetermined 
20 position in the protomers, an affinity tag attached to a macromolecular entity. 

When a component of the protein lattice is known to form strong interactions with a 
known peptide sequence, that peptide sequence may be used as a tag to be added to the 
target macromolecular entity. Where no such tight binding partner is known, suitable tags 
may be identified by means of screening. The types of screening possible are phage- 
25 display techniques, or redundant chemical library approaches to produce a large number of 
different short (for example 3-50 amino acid) peptides. The tightest binding peptide 
elements may be identified using standard techniques, for example amplification and 
sequencing in the case of phage-displayed libraries or by means of peptide sequencing in 
the case of redundant libraries. 
30 To attach the macromolecular entity to the protein lattice using an affinity tag on 

the lattice or the macromolecular entity, the macromolecular entity may be allowed to 
diffuse into, and hence become attached to, a pre-formed protein lattice, for example by 
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annealing of the bound macromolecular entity into their lowest energy configurations in 
the protein lattice may be performed using controlled cooling in a liquid nitrogen 
cryostream. Alternatively, the macromolecular entities may be mixed with the protomers 
during formation of the protein lattice to assemble with the lattice. 

5 In another class of uses, proteins having useful properties could be incorporated as 

one of the protomers. 

A use in which an entity is attached to the protein lattice is to perform X-ray 
crystallography of the macromolecular entities. In this case, the regular structure of the 
protein lattice allows the macromolecular entities to be held in an array at a predetermined 

10 position relative to a repeating unit, so that they are held in a regular array and in a regular 
orientation. X-ray crystallography is important in biochemical research and rational drug 
design. 

The protein lattice having an array of macromolecular entities supported thereof 
may be studied using standard x-ray crystallographic techniques. Use of the protein lattice 
15 as a support in x-ray crystallography is expected to provide numerous and significant 

advantages over current technology and protocol for X-ray crystallography, including the 
following: 

(1) Significantly lower amounts of macromolecule will be required (probably of order 
micrograms rather than milligrams). This will allow determination of some previously 

20 intractable targets. 

(2) Use of affinity tags will allow structure determination without the typical 
requirement for a number of purification steps. 

(3) There will be no need to crystallize the macromolecular entity. This is a difficult 
and occasionally insurmountable step in traditional X-ray structure determination. 

25 (4) There will be no need to obtain crystalline derivatives for each novel crystal 

structure to obtain the required phase information. Since the majority of scattering matter 
will be the known protein lattice in each case, determination of the structure may be 
automated and achieved rapidly by a computer user with little or no crystallographic 
expertise. 

30 (5) The complexes of a protein with chemicals (substrates/drugs) and with other 
proteins can be examined without requiring entirely new crystallization conditions. 
(6) The process is expected to be extremely rapid and universally applicable, which 
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will provide enormous savings in time and costs. 

For use in catalysing biotransformations, enzymes may be attached to the protein 
lattice, or incorporated in the protein lattice. 

For use in data storage, it may be possible to attach a protein which is optically or 
5 electronically active. One example is Bacteriorhodopsin, but many other proteins can be 
used in this capacity. In this case, the protein lattice would hold the attached protein in a 
highly ordered array, thereby allowing the array to be addressed. The protein lattice is 
expected to be able to overcome the size limitations of existing matrices for holding 
proteins for use in data storage. 
10 For use in a display, it may be possible to attach a protein which is photoactive or 

fluorescent. In this case, the protein lattice would hold the attached protein in a highly 
ordered array, thereby allowing the array to be addressed for displaying an image. 

For use in charge separation, a protein which is capable of carrying out a charge 
separation process may be attached to the protein lattice, or incorporated in the protein 
15 lattice. Then the protein may be induced to carry out the separation, for example 

biochemically by a "fuel" such as ATP or optically in the case of a photoactive centre such 
as chlorophyll or a photoactive protein such as rhodopsin. A variety of charge separation 
processes might be performed in this way, for example ion pumping or development of a 
photo- voltaic charge. 

20 For use as a nanowire, a protein which is capable of electrical conduction may be 

attached to the protein lattice, or incorporated in the protein lattice. Using an anisotropic 
protein lattice, it might be able to provide the capability of carrying current in a particular 
direction. 

For use as a motor, proteins which are capable of induced expansion/contraction 
25 may be incorporated into the protein lattice. 

The protein lattices may be used as a mould. For example, silicon could be 
diffused or otherwise impregnated into the pores of the protein lattice, thus either partially 
or completely filling the lattice interstices. The protein material comprising the original 
lattice may, if required, then be removed, for example, through the use of a hydrolysing 
30 solution. 
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CLAIMS 

1 . A protein lattice having a regular structure with a repeating unit repeating in three 
dimensions, 

the repeating unit comprising protein protomers which each comprise at least two 
5 monomers fused together, the monomers each being monomers of a respective oligomer 
assembly into which the monomers are assembled for assembly of the protomers into the 
lattice, 

wherein the repeating unit comprises protomers comprising at least a first monomer 
which is a monomer of a first oligomer assembly which is symmetrical in three 
10 dimensions. 

2. A protein lattice according to claim 1, wherein the first oligomer assembly has a set 
of rotational symmetry axes extending in three dimensions, 

whereby said repeating unit includes protomers with the first monomers of the 
15 protomers being assembled into said first oligomer assembly and, in respect of respective 
ones of said set of rotational symmetry axes, with further monomers of the protomers fused 
to respective first monomers being arranged symmetrically around said respective one of 
said set of rotational symmetry axes. 

3. A protein lattice according to claim 2, wherein, in said protomers, said further 

20 monomers are monomers of a further oligomer assembly which has a rotational symmetry 

axis of the same order as the respective one of said set of rotational symmetry axes of said 

first oligomer assembly, 

whereby said repeating unit includes said protomers with said further monomers 

being assembled into respective further oligomer assemblies with said rotational symmetry 
25 axis of each respective further oligomer assembly being aligned with said respective one of 

said set of rotational symmetry axes of said first oligomer assembly. 

4. A protein lattice according to claim 1 , wherein the first oligomer assembly has a set 
of rotational symmetry axes extending in three dimensions, and, in said protomers, further 
monomers fused to said first monomers are monomers of respective further oligomer 
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assemblies which have a rotational symmetry axis of the same order as a respective one of 
said set of rotational symmetry axes of said first oligomer assembly, 

whereby said repeating unit includes protomers with the first monomers of the 
protomers being assembled into said first oligomer assembly and, in respect of respective 
5 ones of said set of rotational symmetry axes, with further monomers of the protomers fused 
to respective first monomers being assembled into respective further oligomer assemblies 
with said rotational symmetry axis of said respective further oligomer assemblies being 
aligned with the respective rotational symmetry axis of said first oligomer assembly. 

5. A protein lattice according to any one of claims 2 to 4, wherein the orders of the 
10 rotational symmetry axes of said set of rotational symmetry axes are a respective one of 2, 

3, 4 or 6. 

6. A protein lattice according to any one of claims 2 to 5, wherein each of said 
monomers of said respective oligomer assemblies either is a naturally occurring protein or 
is based on a naturally occurring protein with peptide elements being absent from, 

15 substituted in, or added to the naturally occurring protein without substantially affecting 
assembly of monomers of said respective oligomer assembly. 

7. A protein lattice according to claim 6, wherein, in said protomers, said monomers 
are fused via a linking group. 

8. A protein lattice according to claim 7, wherein the linking group is oriented relative 
20 to the first and further monomers in the protomer in its normal form prior to assembly to 

reduce any difference in the assembled lattice in either or both of the position and 
orientation of (a) the termini of said first monomers in their arrangement in said first 
oligomer assembly in its natural form symmetrically around said respective one of said set 
of rotational symmetry axes of said first oligomer assembly, and (b) the termini of said 
25 further monomers in their arrangement in said further oligomer assembly in its natural 
form symmetrically around said rotational symmetry axis of said respective further 
oligomer assembly. 
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9. A protein lattice according to any one of claims 3 to 8, wherein the protomers are 
homologous with respect to the monomers. 

1 0. A protein lattice according to claim 9, wherein said first oligomer assembly belongs 
to either a tetrahedral point group or an octahedral point group. 

11. A protein lattice according to claim 10, wherein said further oligomer assembly 
belongs to a dihedral point group of the same order as the respective one of said set of 
rotational symmetry axes of said first oligomer assembly. 

12. A protein lattice according to claim 10, wherein said further oligomer assembly 
belongs to either a tetrahedral point group or an octahedral point group. 

13. A protein lattice according to claim 9, wherein said first oligomer assembly belongs 
to a dihedral point group of order 3, 4 or 6, and said protomers comprise at least two 
further monomers with a further monomer fused to each terminus of said first monomer of 
said first oligomer assembly. 

14. A protein lattice according to claim 13, wherein one of said further monomers is a 
monomer of an oligomer assembly which belongs to a dihedral point group of the same 
order as the dihedral point group to which the first oligomer assembly belongs. 

15. A protein lattice according to claim 14, wherein the other of said further monomers 
is a monomer of an oligomer assembly which belongs to a dihedral point group of order 2. 

1 6. A protein lattice according to any one of claims 3 to 8, wherein the protomers are 
3 heterologous with respect to the monomers. 

17. A protein lattice according to claim 16, wherein the unit cell includes protein 
protomers of two types, wherein the two types of protomer include different monomers of 
the same heterologous oligomer assembly. 
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18. A protein lattice according to claim 17, wherein at least a first type of protomer 
constitutes said protomers with the first monomers of the protomers being assembled into 
said first oligomer assembly and said further monomers of the protomers fused to 
respective first monomers are one of said different monomers of the same heterologous 

5 oligomer assembly, said heterologous oligomer assembly belonging to a cyclic point 
group. 

19. A protein lattice according to claim 1 8, wherein said first oligomer assembly of the 
first type of protomer belongs to either a tetrahedral point group or an octahedral point 
group. 

10 20. A protein lattice according to claim 19, wherein the second type of protomer 
comprises a monomer which is a monomer of an oligomer assembly belonging to a 
dihedral point group of the same order as said heterologous oligomer assembly. 

21. A protein lattice according to claim 1 8, wherein the second type of protomer 
comprises a monomer which is a monomer of an oligomer assembly belonging to either a 

1 5 tetrahedral point group or an octahedral point group. 

22. A protein lattice according to any one of the preceding claims having an array of 
macromolecular entities attached thereto. 

23. A protein lattice according to claim 22, wherein the protomers have, at a 
predetermined position in the protomers, an affinity tag attached to a macromolecular 

20 entity. 

24. A protein lattice according to claim 22 or 23, wherein the macromolecular entities 
have a peptide affinity tag attached to one of the protomers in the protein lattice. 



25. Use of a protein lattice according to any one of claims 1 to 24 as a support for the 
array of macromolecular entities for x-ray crystallography of the macromolecular entities. 
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26. A method of performing x-ray crystallography comprising supporting an array of 
macromolecular entities on a protein lattice according to any one of claims 1 to 24 and 
performing x-ray crystallography on the lattice having the macromolecular entities 
supported thereon. 

5 27. A protein protomer comprising at least two monomers fused together, the 
monomers each being monomers of a respective oligomer assembly into which the 
monomers are capable of self-assembly to assemble at least part of a repeating unit of a 
protein lattice having a regular structure repeating in three dimensions, wherein, in said 
protomer, at least a first monomer is a monomer of a first oligomer assembly which is 

10 symmetrical in three dimensions. 

28. A protein promoter according to claim 27, wherein the first oligomer assembly has 
a set of rotational symmetry axes extending in three dimensions, and, in said protomers, 
further monomers fused to the first monomers are monomers of respective further oligomer 
assemblies which have a rotational symmetry axis of the same order as a respective one of 

1 5 said set of the rotational symmetry axes of said first oligomer assembly, 

whereby said repeating unit includes protomers with the first monomers of the 
protomers being assembled into said first oligomer assembly and, in respect of respective 
ones of said set of rotational symmetry axes, with further monomers of the protomers fused 
to respective first monomers being assembled into respective further oligomer assemblies 

20 with said rotational symmetry axis of said respective further oligomer assemblies being 
aligned with the respective rotational symmetry axis of said first oligomer assembly. 

29. Plural different protein protomers according to claim 27 or 28, wherein the 
monomers of the plural different protomers are capable of self-assembly with each other to 
form the entire protein lattice. 

25 30. A polynucleotide encoding a protein protomer according to claim 27 or 28 or one of 
the respective protein protomers of said plural different proteins according to claim 29. 



31. A vector capable of expressing a protomer according to claim 27 or 28 or one of the 
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respective protein protomers of said plural different protein protomers according to claim 
29. 

32. A host cell comprising a vector according to claim 3 1 . 

33. A method of making a protein protomer according to claim 27 or 28 or one of the 
respective protein protomers of said plural different protein protomers according to claim 
29, comprising expressing a polynucleotide sequence which encodes the protomer in a host 
cell and, optionally, purifying the expressed protomer. 



34. 



A method of making a protein lattice according to any one of claims 1 to 24. 



THtS PAGE BLANK n 




Homologous Homologous 

o a, 

Human Heavy Chain E.coli PurE 



WO 2004/033487 



2/3 



PCT/GB2003/004306 



Binary Mixed Crysalin 

P3/D3 



Components 

P 3 and D 3 



Protomers 

p 3 c 3A and d 3 c 3A * 



Assemblies 

T, C 3 and D 3 




Homologous 
T 

Exoli dps 



Bacteriophage T4 gp5 and 



Homologous 
D 3 

Human PTPS 



INTERNATIONAL 



RCH REPORT 




■ Application No 

Tt/GB 03/04306 



A. CLASSIFICATION OF SUBJECT MATTER , 

IPC 7 C07K1/00 C12N9/18 



C07K14/11 



According to International Patent Classification (IPC) orlo both natlonaJ classification and IPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C07K C12N 



Documentation searched other than minimum documentation to the extent that such documents are included In the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

EPO-Internal, WPI Data, BIOSIS, PAJ 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° . Citation of document, with Indication, where appropriate, of the relevant passages 



Relevant to claim No. 



WO 00 68248 A (UNIV CALIFORNIA) 
16 November 2000 (2000-11-16) 
cited 1n the application 
the whole document 

REZA GHADIRI M ET AL: "SELF-ASSEMBLING 

ORGANIC NAN0TUBES BASED ON A CYCLIC 

PEPTIDE ARCHITECTURE" 

NATURE, MACMILLAN JOURNALS LTD. LONDON, 

GB, 

vol. 366, 25 November 1993 (1993-11-25), 
pages 324-327, XP002936460 
ISSN: 0028-0836 
the whole document 

-/— 



1-34 



Further documents are listed in the continuation of box C. 



Patent family members are listed (n annex. 



° Special categories of cited documents : 

•A' document defining the general state of the art which Is not 

considered to be of particular relevance 
a E a earlier document but published on or after the international 

filing date 

*L° document which may throw doubts on priority daim(s) or 
which Is cited to establish the publication date of another 
citation or other special reason (as specified) 

a O* document referring to an oral disclosure, use, exhibition or 
other means 

a P a document published prior to the international filing date but 
later than the priority date claimed 



T later document published after the Intemallonal filing date 
or priority date and not In conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

a Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the Intemallonal search 



19 January 2004 



Date of mailing of the international search report 



27/01/2004 



Name and mailing address of the ISA 

European Patent Office. P.B. 5618 Patenttaan 2 
NL - 2280 HV Rljswfjk 
TeL (+31-70) 340-2040. Tx. 31 651 epo nl, 
Fax (+31-70)340-3016 



Authorized officer 



Keller, Y 



Form PCT/1SA/210 (second sheet) (July 1992) 



INTERNATIONAL 1 



RCH REPORT 



I Application No 

TCT/GB 03/04306 



C(Contlnuatlon) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category • Cttatton of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



PLESCHBERGER M ET AL: "Generation of a 
functional monomol ecul ar protein lattice 
consisting of an s-layer fusion protein 
comprising the variable domain of a camel 
heavy chain antibody" 

BIOCONJUGATE CHEMISTRY, AMERICAN CHEMICAL 

SOCIETY, WASHINGTON, US, 

vol. 14, no. 2, March 2003 (2003-03), 

pages 440-448, XP002240939 

ISSN: 1043-1802 

the whole document 

N00REN I M A ET AL: "Structural 
Characterisation and Functional 
Significance of Transient Protein-Protein 
Interactions" 

JOURNAL OF MOLECULAR BIOLOGY, LONDON, GB, 
vol . 325, no. 5, 

31 January 2003 (2003-01-31), pages 
991-1018, XP004450001 
ISSN: 0022-2836 
the whole document 



F«m PCT/1SA/210 (continuation of second shoot) (July 1992) 



internation/SBearch report 



Patent document 
cited in search report 



Publication 
date 



WO 0068248 



16-11-2000 



AU 
W0 



Application No 

^T/6B 03/04306 



Patent family 
member(s) 



Publication 
date 



6889400 A 
0068248 A2 



21-11-2000 
16-11-2000 



Form PCT/1SA/210 (patent f amity annex) (July 1992) 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 
QrFADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

^/cOLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCED) OR EXHEBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



